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Abstract. We study a class of directed random graphs. In these graphs, the interval 
[0,:r] is the vertex set, and from each y £ [0, x], directed links are drawn to points in 
the interval (y, x] which are chosen uniformly with density one. We analyze the length 
of the longest directed path starting from the origin. In the x — > oo limit, we employ 
traveling wave techniques to extract the asymptotic behavior of this quantity. We also 
study the size of a cascade tree composed of vertices which can be reached via directed 
paths starting at the origin. 



PACS numbers: 02.50.Cw, 92.20.jq 
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1. Introduction 

A random graph is a set of vertices that are connected by random links [U [2J El HI El El U\ ■ 
Random graphs underlie numerous natural phenomena ranging from polymerization 
P E] to the spread of infectious diseases [TU], and they also have applications to 
transportation systems, electrical distribution systems, the Internet, the world-wide 
web, social networks, etc. [HI [121 02] • 

In random graph models, links are usually treated as undirected. In a growing 
number of applications, however, directionality plays a prominent role. One example is 
modeling of the web growth [JJJ [T51 HH1 [T21 [J3] . In modeling of food webs directionality 
(reflecting predation) is even more crucial. Food webs are directed graphs with vertexes 
{0, 1, . . . , m} labeling different species. The presence of the directed link indicates 
that species i is eaten by species j. Usually in food web only links with i < j are 
allowed. (Loops (i, i) which would account cannibalism are ignored; the directed link 
with i > j could e.g. represent predation on the young of the 'stronger' species i 
by adults of species j, but such links are also disregarded in most models.) The simplest 
cascade model [TTJ, [TBI EE EDI EB [22] generates a food web at random, namely for each 
pair of species % and j with i < j the directed link (i,j) is drawn at random with a 
certain predation probability c. A number of questions, particularly those related to 
the maximal length of food chains, have been investigated in the framework of the this 
cascade model. For instance, what is the length (the number of links) of the longest 
direct path starting from the basal species (vertex 0)? A dual question concerns the 
length of the longest path finishing at the top species (vertex m). One can also ask 
about the length M of the longest path irrespectively on the first and last species. 

The simplest cascade model is a kind of 'standard model' in the subject, and it had 
been widely used to interpret ecological data on community food webs [18] . The standard 
cascade model provides a very natural mechanism for generating directed random graphs 
and the same model has been suggested in other contexts, e.g. as a model of parallel 
computation [231121] in which the presence of the directed link with i < j indicates 
that task i must be performed before task j. For a parallel computation in which each 
task takes a unit of time, the processing time will be equal M + l (where M is the length 
of the the longest path). 

Food webs typically involve a huge number of specie^fj while the average predation 
per species is usually not too large. Hence it is interesting to investigate large food webs 
with small predation probability, more precisely the scaling limit 

m — > oo, c — > 0, cm = x = finite (1) 

This suggests to study a continuum cascade model where the vertex set is the interval 
[0,a;]. For each species y, the number of predator species is random, such species are 
chosen at random from the interval (y, x] according to the Poisson distribution with unit 
density. The Poisson distribution immediately follows from the binomial distribution 

| Small food webs tend to reflect our ignorance rather than reality. 
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(characterizing the discrete cascade model) in the scaling limit ([T]). This cascade model 
is the minimalist continuum model of directed random graphs. Simple models tend 
to arise in various unrelated subjects and they are interesting on purely intellectual 
grounds. Nevertheless, for concreteness in the following exposition we shall often use 
the language of food webs. 

The rest of this article is organized as follows. In Sec. [2] we define the model, discuss 
its simplest properties, and derive a recurrence for the longest directed path starting 
from the origin. The asymptotic behavior of the solution to that recurrence is analyzed 
in the following sections [3] and |4} In section [5] we discuss the total number of vertices in 
a cascade tree with the root at the origin; on the language of food webs it counts the 
basal species and species feeding on it, both directly and indirectly. 



2. Continuum Cascade Model 



The vertex set of our random graph is the interval [0,a;]. In the illustrative picture 
below we draw only the vertex set and links from the cascade subgraph initiating at the 
origin (the open circle on the picture). Namely, we draw all links emanating from the 
origin indicating direct predation on the basal species (there are 3 such predators in the 
picture); then we draw all the links from these direct predators (4 such predators in the 
picture); etc. Links are drawn in a cascade manner thereby explaining the name of the 
model. 




Overall, in the above illustrative picture the cascade subgraph is a tree with 10 
links and 11 vertices. Six of these vertices (closed circles on the picture) are terminal, 
that is, there are no links emanating from them. Every cascade subgraph is a tree; the 
size and the number of terminal vertices in cascade trees fluctuate from realization to 
realization. 

Terminal vertices represent top predators on the language of food webs. It is easy 
to compute the fraction of top predators: 

1 r x 1 - e~ x 

T = - dye~y= 1 -^- (2) 
X Jo X 

The fraction of bottom preya^J that is, species who do not eat other species, is the same. 



§ Bottom preys are often called basal species. We reserve the term 'basal species' only for the species at 
the origin which, according to the definition of the continuum cascade model, can never be a predator 
independently on the choice of links. 
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Figure 1. The cascade tree with the basal species (the vertex at the top) playing the 
role of the root. The height of this cascade tree is equal to 3. 



The overlap of the sets of top predators and bottom preys (one can call them neutral 
species) is non-empty, the fraction of neutral species is 

N = 1 f dy e~ y e~ {x - y) = e~ x (3) 
x Jo 

We now turn to more subtle properties of the continuum cascade model which 
are related to the cascade tree. This tree is finite and it varies from realization to 
realization; accordingly, the properties of the cascade tree are probabilistic. To define 
these properties it is convenient to utilize a more traditional way of plotting trees; the 
cascade tree pictured above is presented on Fig. [T] This figure resembles binary search 
trees and both the relevant properties of binary search trees and the methods used in 
analysis of binary search trees ^3 |2Sl EZ1 12HI 123 EDI Ell E21 E31 Ell 133 ESI are useful 
in our situation. For instance, the height of the binary search trees has attracted a lot 
of attention, and a traveling wave analysis [33^ [MJ E3] has provided a very efficient way 
of tackling the asymptotic (in number of vertices of the tree) behavior of the height. 
In the present problem, the height is indeed an interesting quantity, namely it is the 
length of longest chain from the basal species to the bottom of the cascade tree, and 
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the traveling wave analysis will be helpful as well. 

We now establish a recurrence relation for the height distribution. The height is a 
non-negative integer. It is convenient to work with the cumulative distribution 

P n (x) = Prob(height < n) (4) 

The basal species is the terminal vertex with probability e~ x , and therefore 

Po(x) = e~ x (5) 



For n > 1, 



k\ Jo Jo xx 



k>0 



k 



where the first line accounts for any possible number k > of links emanating from 
the origin and finishing at all possible points x — yj. There is no 'interaction' between 
different branches of the cascade tree, so we merely must assure that all cascade trees 
originating at x — yj have heights not exceeding n— 1. Computing the sum in the above 
equation we arrive at our main recurrence 

P n (x) = exp -x+ f P n -i(y)dy) (6) 

Starting with (j5]) we find 

Pi(x) = exp -x + 1 - e~ x . (7) 

One can recursively determine P2, then P3; analytical expressions for P n become very 
cumbersome as n increases. Fortunately, in the large 'time' limit, n>l, the behavior 
greatly simplifies, namely the solution acquires a traveling wave form [see Fig. |2), 

p n (x)->n(0, Z = x-x f , (8) 

with the front position growing linearly with 'velocity' equal to e -1 : 

1 

Xf ~ vn, v = - (9) 

The traveling wave profile n(£) decreases monotonically from 1 to as £ increases 
from —00 to 00. More precisely, 

) oc e~* when £ 00 (10) 

and 

l-n(£)oce eC when £ -> -00 (11) 

In the next section we give an elementary argument which allows one to understand ([9]). 
A more comprehensive traveling wave analysis that leads to above results is presented 
in section HI 
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Figure 2. The distribution P n (x) versus x obtained by iteration the recurrence (j6J). 
Shown is P n (x) for n = 20,40,60,80 (left to right). Iterations were performed using 
Mathematica. The observed velocity of the traveling wave is in a good agreement with 
the theoretical prediction ([£]). 



3. Elementary derivation of the traveling wave velocity 



Let us begin with the behavior of P n (x) for small x. Expanding Pq(x) and Pi(x), see 
equation (J7|, we obtain 



• Is 

1 - x + -x - -x 6 + . . . , Pi 



1 - -x 2 + -x 3 + —x 4 + . 
2 6 12 



2 6 

Using equation ^ we recurrently determine the expansions of the following P n to yield 

1 



1 - 



n+1 



(12) 



(13) 



(n + 1)! 

This result is easy to prove by induction. One can continue this expansion, e.g., 

Pn = 1 - 7 1 „ X n+1 + , 1 ,, X n+2 + , 2 x , X n+3 + . . . 

(n + 1)! (ra + 2)! (n + 3)! 

is valid for all n > 1; this is also easily proven by induction. 

Let us now estimate the front position Xf from the criterion P n (xf) = \. Keeping 
only two terms as in equation (12) we obtain x^ +1 = |(n + 1)!, or xj = in the 
leading order. What will happen if we keep e.g. four terms in the expansion? Using the 
same criterion P n (xf) — \ in conjunction with equation (13) we get 

1 



x 



n+1 
/ 

(n + 1)! 



1 - 



x f 



2x) 



n 



(n + 2)(n + 3)_ 

In the leading order we recover the previous prediction Xf = This does not prove 
equation ([9]), but at least it shows its consistency with the series (13). 
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4. Traveling wave analysis: Velocity selection 
We want to understand the behavior of the recurrence 



Pn+i(x) 



exp 



-x+ I dyP n (y) 



(14) 



when x ^> 1. We assume the convergence to traveling wave solution and the validity 
of the traveling wave ansatz (J8|. Numerical results strongly support this assumption 
[see Fig. |2) and show that the convergence is rather fast, that is, the asymptotic shape 
emerges already for not to large n. We also assume that Xf = vn (for large n), but we 



do not specify v. The left-hand side of equation (14) becomes 

p n+1 {x) = n(£ - v) 

while the right-hand side of equation (fl4|) turns into 



exp 



— x + 



drj n(r/) 



r] = y — nv 



(15) 



(16) 



We will allow |£| to be large, but we will always assume that |£| <C nv. In this situation, 
the integral in equation (jT6j) can be simplified as follows: 

rt [0 ft 

dr)H[r)) — / ^77 11(77) + / ^11(77) 

v J —nv JO 

= nv+ dr] [11(77) - 1] + / dr] 11(77) 

J —nv Jo 

= nv + ( dr] 11(77) — L + L n , 
Jo 



(17) 



where we have used the shorthand notation 
L [° d77[l-n(77)], L n = 



d77[i-n(77)]. 



Since 11(77) quickly approaches to 1 as 77 — > —00, we drop L n from (17); we shall justify 



this step a posteriori. Combining (16) and (17) we see that the right-hand side of (14) 
becomes 



exp 



-Z-L + [ dr]U(r]) 
Jo 



Equating (15) and (19) we arrive at 

n(^ - v) = exp - L + / dr] IlOn) 

[ ' Jo 

This is the governing equation for n(£). 
4-1. Far ahead of the front: £ — > 00 



(19) 



(20) 



Since IT(£) quickly approaches to zero as £ — > 00, equation (20) gives n(£— v) ~ e * L+R 



where R = J °° dr]U(r]). Thus we confirm (10); more precisely we get 
n(£) ~ e R - L ~ v e^ when £ ->■ 00. 



(21) 
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4-2. Far behind the front: £ — > — oo 
It is more convenient to work with $(£) 



1 — Il(£) rather than II(£). In terms of 



equation (20) becomes 



cxp 



ft 

-L - / dr]§(ri) 
Jo 



(22) 



From the definition (18) we see that 
L = di]§(ri). 

J — oo 

Therefore we can re-write (|22l) as 



exp 



dr] &(r]) 



(23) 



Equation (23) is equivalent to Eq. (20), we have not made any approximation. Turning 



to the £ — > — oo limit we note that in this regime $ — > and hence we can expand the 



exponent on the right-hand side of equation (23). Keeping only two terms we simplify 



equation (23) to 



dt] $(77). 



(24) 



From (24), or from equation $'(£ — v) — obtained by differentiating of Eq. (24) 



we see that the solution has an exponential form 
$(£) = De a t 



Plugging (25) into (24) we arrive at the dispersion relation 

ae~ av = 1 



(25) 



(26) 



An elementary analysis of this equation indicates that solutions exist only when v < e -1 . 
We now invoke the selection principle which asserts that the extremal value, v = e -1 in 
our case, is realized. 

Traveling wave solutions have been investigated in the context of partial differential 
equations. A few partial differential equations admitting traveling wave solutions have 
been deeply studied. One such equation is the celebrated Fisher-KPP equation [381 EH] 
for which the selection principle had been proven [211 HO] for sufficiently steep initial 
conditions. (For more recent work see e.g. [1U H2J H3j. A very comprehensive review 
of traveling wave solutions of non-linear partial differential equations has been given 
by van Saarloos [Sj, a lighter expositions appear in books [321 HH1 [9].) More recently, 
traveling wave solutions have been investigated in the context of nonlinear recurrences 
arising in the analyses of binary search algorithms [321 EH ES] , kinetic theory [47] , and 
other problems [HI HH1 [5Q1 [51]; see [521 [53] for a review of the applications of traveling 
wave techniques to recurrences. 

Asymptotically, the wave front advances at a constant velocity v = e _1 . The 
approach to this asymptotic value is rather slow, namely there is a n~ l correction in 
the leading order, resulting in a logarithmic correction to the front position. This 
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correction was first established by Bramson [40J for the Fisher-KPP equation; it was 
subsequently generalized [HI H21 H3j HI] to more general partial differential equations 
and to recurrences (33J, EH [35J HHJ 09]. This correction generally has the form ^ Inn. 
For the selected velocity v = e -1 , the decay amplitude a = e is implied by dispersion 



relation (26). Taking into account this logarithmic correction we get 



3 

Xf = e~ l n+ — lnn + 0(1) (27) 

It was convenient to think about x and n as space and time coordinates, so that 
the front of the traveling wave was advancing and we determined Xf — Xf(n). In the 
original problem, the parameter x is fixed and we are interested in the height H(x) of 
the cascade tree. The height is essentially the inverse to Xf = Xf(n) which is taken 
when Xf = x. Thus 

H(x) = ex- ^ lnx + C(l) (28) 



The height is of course a random quantity. Equation (28) gives the average height. 
In the x — > oo limit, the average provides a faithful description as it is a growing 
quantity while the variance remains finite. We haven't proved this assertion, but at 
least on the physical level of rigor it is obvious: The probability distribution Pn(x) has 
asymptotically a traveling wave shape with the width of the front remaining finite, and 
this is essentially equivalent to the finite width of the height distribution. 

5. Size of the Cascade Tree 

The size S(x) of the cascade tree, that is, the total number of vertices in the tree, is a 
random variable. Let us compute the average size (S(x)). From the definition of the 
continuum cascade model we deduce 

N ; '•' dy x dy k 



1+ / dy{S(y)) (29) 







Differentiating (29) we obtain = (S), from which 



(3(x)) = e x (30) 
A similar line of reasoning leads to an integral equation for the second moment 

rxd y ,cnt u , xk ~x, r d y 



x 



(%)> 



oo „fc 



~ x k _ x fc(fc-l) r* d Vl [* dy 2 
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Using (30) we simplify above integral equation to 

(S 2 (x)) = 1 + f dy (S(y)) + 2(e* - 1) + (e» - l) 2 



(S 3 (x))= 1 ^e 3 *- 1 ±e*- 3 2 xe* (32) 



which is solved to yield 

(S 2 (x)} = 2e 2x - e x (31) 
One can continue and compute 

<S 3 (*)> = ^e 3 *-^ 

and a few higher moments (S p (x)), but results quickly become very cumbersome. The 
explicit results (30)-(32) show that, in contrast to the height, the size of the cascade 
tree is the random quantity whose limiting distribution (in the x — > oo limit) remains 
broad. More precisely, in the scaling limit 

x — > oo, S — > oo, a = e~ x S = finite 
the size distribution becomes 

Prob[6» = S] = e~ x F(a) 
with the limiting distribution being different from the delta function, F(o~) ^ 8{a — 1). 
The normalization requirement together with (30)-(32) and similar equations for 
higher moments (S p (x)) show that the moments M p = J^°daa p F(a) of the limiting 
distribution are 

M = M 1 = 1, M 2 = 2, M 3 = ^, ^4 = y, M 5 = 25, 

etc. 



6. Summary 

We proposed a minimalist model of infinite directed random graphs. The model is a 
continuum version of a model of finite directed random graphs, known as the cascade 
model, which has been investigated in the context of food webs and parallel computation. 

Our model presumes a total order on the set of vertices. We chose the simplest 
such set, an interval of length x. From each y e [0, x], directed links to points y' > y are 
drawn at random according to the Poisson distribution, that is, the points y' G (y, x] 
are chosen independently from each other and uniformly with density one. The analysis 
of this continuum cascade model is actually simpler than the analysis of the discrete 
cascade model. This is demonstrated by studying the distribution of the length of the 
longest directed paths starting at the origin (equivalently, the height of the cascade tree 
with the root at the origin). We employed traveling wave techniques to extract the 
asymptotic behavior of the length of the longest directed paths in the x — > oo limit. It 
will be interesting to understand the limiting distribution of the size of the cascade tree 
with the root at the origin as well as other properties of the continuum cascade model. 
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